Segmentation-free word recognition with application to Arabic
نویسندگان
چکیده
This paper describes the design and implementation of a system that recognizes machine-printed Arabic words without prior segmentation. The technique is based on describing symbols in terms of shape primitives. At recognition time, the primitives are detected on a word image using mathematical m.orphology operations. The system then matches the detected primitives with symbol models. This leads to a spatial arrangement of m.atched symbol models. The system conducts a search in the space of spatial arrangements of models and outputs the arrangement with the highest posterior probability as the recognition of the word. The advantage of using this whole word approach versus a segmentation approach is that the result of recognition is optimized with regard to the whole word. Results of preliminary experiments usin.g a lexicon of @,OOO words show a recognition rate of 99.4% for noise-free text and 73% for scanned tex.
منابع مشابه
Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملMulti-Font Arabic Word Recognition Using Spectral Features
In this paper we present a new technique for recognising Arabic cursive words from scanned images of text. The approach is segmentation-free, and is applied to four different Arabic typefaces, where ligatures and overlaps pose challenges to segmentation-based methods. We transform each word into a normalised polar image, then we apply a two dimensional Fourier transform to the polar image. The ...
متن کاملRegion growing based segmentation algorithm for typewritten and handwritten text recognition
This paper presents a new technique of high accuracy to recognize both typewritten and handwritten English and Arabic texts without thinning. After segmenting the text into lines (horizontal segmentation) and the lines into words, it separates the word into its letters. Separating a text line (row) into words and a word into letters is performed by using the region growing technique (implicit s...
متن کاملAn Arabic optical character recognition system using recognition-based segmentation
Optical character recognition (OCR) systems improve human}machine interaction and are widely used in many areas. The recognition of cursive scripts is a di$cult task as their segmentation su!ers from serious problems. This paper proposes an Arabic OCR system, which uses a recognition-based segmentation technique to overcome the classical segmentation problems. A newly developed Arabic word segm...
متن کاملWord-level recognition of multifont Arabic text using a feature vector matching approach
Many text recognition systems recognize text imagery at the character level and assemble words from the recognized characters. An alternative approach is to recognize text imagery at the word level, without analyzing individual characters. This approach avoids the problem of individual character segmentation, and can overcome local errors in character recognition. A word-level recognition syste...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995